Toward Automatic Recognition of Japanese Broadcast News
نویسندگان
چکیده
In this paper we report on automatic recognition of Japanese broadcast-news speech. We have been working on largevocabulary continuous speech recognition (LVCSR) for Japanese newspaper speech transcription and achieved reasonably good performance. We have recently applied our LVCSR system to transcribing Japanese broadcast-news speech. We extended the vocabulary to 20k words and trained the language models using newspaper texts and broadcast-news manuscripts. These two language models were applied to our evaluation speech sets. The language model trained using broadcast-news manuscripts achieved better results for broadcast-news speech than the language model trained using newspaper texts which achieved better results for newspaper speech. In preliminary experiments on Japanese broadcastnews transcription, we achieved a word accuracy of 79.3% for anchor-speakers’ speech by using a language model trained using broadcast-news manuscripts and newspaper texts.
منابع مشابه
Toward automatic transcription of Japanese broadcast news
In this paper, we report on the automatic recognition of Japanese broadcast-news speech. We have been working on largevocabulary continuous speech recognition (LVCSR) for Japanese newspaper speech transcription and have achieved good performance. We have recently applied our LVCSR system to transcribing Japanese broadcast-news speech. We extended the vocabulary from 7k words to 20k words and tr...
متن کاملJapanese broadcast news transcription
In this paper, we describe the on-going development of a Japanese Broadcast News Transcription system at BBN Technologies. This is a collaboration between BBN and NHK to use automatic speech recognition technology to provide live closed caption for NHK’s TV news programs in Japan. We describe what the NHK Broadcast News Corpus comprises and how we adopted transcription technology developed for ...
متن کاملRecent advances in Japanese broadcast news transcription
In this paper, we report on language modeling and acoustic modeling studies for Japanese broadcast news speech recognition. We constructed a language model that reduces recognition errors by utilizing context-dependent readings of Japanese characters. We also introduced filled-pause modeling into the language model. To improve the model’s performance for a series of sentences spoken by one spea...
متن کاملNew Real-Time Closed-Captioning System for Japanese Broadcast News Programs
A new real-time closed-captioning system for Japanese broadcast news programs is described. The system is based on a hybrid automatic speech recognition system that switches input speech between the original program sound and the rephrased speech by a ”re-speaker”. It minimises the number of correction operators, generally to one or two, depending on the difficulties of the speech recognition, ...
متن کاملAutomatic Broadcast News Speech Summarization
Automatic Broadcast News Speech Summarization
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1997